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ESTIMATING MULTIDIMENSIONAL PERSISTENT HOMOLOGY 
THROUGH A FINITE SAMPLING 

NICCOLO CAVAZZA, MASSIMO FERRI, AND CLAUDIA LANDI 


Abstract. An exact computation of the persistent Betti numbers of a sub¬ 
manifold A of a Euclidean space is possible only in a theoretical setting. In 
practical situations, only a finite sample of X is available. We show that, un¬ 
der suitable density conditions, it is possible to estimate the multidimensional 
persistent Betti numbers of X from the ones of a union of balls centered on 
the sample points; this even yields the exact value in restricted areas of the 
domain. 

Using these inequalities we improve a previous lower bound for the natural 
pseudodistance to assess dissimilarity between the shapes of two objects from 
a sampling of them. 

Similar inequalities are proved for the multidimensional persistent Betti 
numbers of the ball union and the one of a combinatorial description of it. 


1. Introduction 

Persistent Topology is an innovative way of matching topology and geometry, 
and it proves to be an effective mathematical tool in Pattern Recognition 
particularly in Shape Comparison [I]. This new research area is experiencing a 
period of intense theoretical progress, particularly in the form of the multidimen¬ 
sional persistent Betti numbers (PBNs; also called rank invariant in [3]). In order 
to express its full potential for applications, it has to interface with the typical en¬ 
vironment of Computer Science: It must be possible to deal with a finite sampling 
of the object of interest, and with combinatorial representations of it. 

A predecessor of the PBNs, the size function (i.e. PBNs at degree zero) already 
enjoys such a connection, in that it is possible to estimate it from a hnite, sufficiently 
dense sampling m, and it is possible to simplify the computation by processing a 
related graph [9]. Moreover, strict inequalities hold only in “blind strips”, i.e. in 
the w-neighborhood of the discontinuity lines, where ui is the modulus of continuity 
of the filtering (also called measuring) function. Out of the blind strips, the values 
of the size function of the original object, of a ball covering of it, and of the related 
graph coincide. As a consequence it is possible to estimate dissimilarity of the shape 
of two objects by using size functions of two samplings. 

The present paper extends this result to the PBNs of any degree, by using 
an article by P. Niyogi, S. Smale, and S. Weinberger [22], for a ball covering of 
the object X — a submanifold of a Euclidean space — with balls centered at 
dense enough points of X or near X: Theorems 14.11 and 14.21 A combinatorial 
representation of A, with the corresponding inequalities lTheorem l6.ll) is based on 
a construction by H. Edelsbrunner m- In Section [5] we use Theorem 14.11 to get 
a lower bound for the natural pseudodistance between two objects of which only 
hnite samplings are given. 
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All results are provided for multidimensional filtering functions, although most 
current applications just use monodimensional ones. 

It should be noted that the same kind of problem has been addressed in Hby 
using the notion of Weak Feature Size. An inequality rather similar to ours of 
Theorem l4.ll also extending the main result of m, is exposed in the proof of the 
Homology Inference Theorem in Section 4 of [8]. The progress represented by the 
present paper consists in the stress on inequalities (and not just on the consequent 
equalities), in the use of multidimensional filtering functions, in the fact that any 
continuous function — not just distance — is considered, and in the estimate of 
the natural pseudodistance. 

Some simple examples illustrate the results. 


2. Preliminary results 

In this section, we report some results on the stability of the PBNs and some 
topological properties of compact Riemannian submanifolds of M™. For basic no¬ 
tions on homology and persistent homology we refer the reader to [15j and |14| , for 
classical properties of submanifolds to [19] . 

2.1. Multidimensional persistent Betti numbers. In this paper we will always 
work with coefficients in a field K, so that all homology modules are vector spaces. 
First we define the following relation ^ (resp. in K": if u = (ui,... ,m„) and 
V = (ui,..., Vn), we write u < v (resp. u ^ u) if and only if Uj < Vj (resp. Uj < vj) 
for j = 1,..., n. We also define A+ as the open set {(u, u) € R" x R” | u ^ u}. 

As usual, a topological space X is triangulable if there is a finite simplicial complex 
whose underlying space is homeomorphic to X\ for a submanifold of a Euclidean 
space, it will mean that its triangulation can be extended to a domain containing it. 
We use Cech homology because it guarantees some useful continuity properties [5] . 
For terms and concepts concerning this homology, we refer to |15| . 

Persistent Betti numbers Let X be a triangulable space and / = (/i,..., /„) : 
X R" be a continuous function. / is called a filtering function. We denote by 
X{f < u) the lower level subset {p G X\fj{p) < Uj,j = 1,... ,n}. Then, for each 
i G Z, the i-th multidimensional persistent Betti numbers (briefly PBNs) function 
of {X,f) is N defined as = dim(Imi), with 

i ■ HflX{f < u)) ^ HflX{f < v)), 

the homomorphism induced by the inclusion map of the sublevel set 
X{f P u) C X{f v). Here Hi denotes the i-th Cech homology module. 

Remark When PBNs were considered in [8] the filtering functions were taken as 
to be tame, but in our case, since we need continuous maps, we refer to Section 2.2 
of [5] for the relevant extension. 

PBNs give us a way to analyze triangulable spaces through their homological 
properties. Then it is natural to introduce a distance for comparing them. This 
has been done, and through this distance it has been possible to prove stability of 
the PBNs under variations of the filtering function in the one-dimensional [8] and 
multidimensional case |5]. 
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2.2. Topological properties of compact Riemannian submanifolds of R"*. 

As hinted in Section [U we are interested in getting information on a submanifold of 
R™ via a finite sampling of it and a related ball covering. To this goal, we now state 
some properties of compact Riemannian submanifolds of R'”, especially referred to 
such an approximating covering. Definition 12.21 and Proposition 12.21 are due to P. 
Niyogi, S. Smale, and S. Weinberger [22]. The main idea is that, under suitable 
hypotheses, it is possible to get, from a sampling of a submanifold, a ball covering 
whose union retracts on it. 

First, notice that the spaces R™ and R" play two different roles in our arguments: 
the ambient space of our submanifolds (which will always be R™) is endowed with 
the classical Euclidean norm and has no partial order relation on it. On the other 
hand, the codomain of the filtering functions (R" throughout) is endowed with the 
max norm and with the partial order relation as defined at the beginning of 
Section 12.11 

Modulus of continuity Let / : R'" —?► R" be a continuous function. Then, for 
e S R^, the modulus of continuity 0(e) of / is: 


{abs(/i(p) - fjiv)) I P,P e K™, lb - p'll < e}- 

In other words 0(e) is the maximum over all moduli of continuity of the single 
components of /. 

For a given compact Riemannian submanifold X of R'", the normal space of X 
at a point p € X is the vector subspace NpX of the tangent space of R™ at p, 
TpR™, formed by the vectors orthogonal to TpX, the tangent space of X at p. The 
open normal bundle to X of radius s is defined as the subset of TR™ 


{(p,i;) € TR^Ipe X, v&NpX, |b|| < s} 

By Theorem 10.19 of [20] and by compactness, there exists an embedding of the 
open normal bundle to X of radius s into R™ for some s. Its image TuhgX is called 
a tubular neighborhood of X. 

A condition number - is associated with any compact Riemannian submanifold 
XofR"*. 


Definition r is the largest number such that every open normal bundle B about 
X of radius s is embedded in R™ for s < t. 


Proposition (Prop. 3.1 of [22]) Let X be a compact Riemannian subman¬ 
ifold of R™. Let L = be a collection of points of X, and let 

U = [Jj-i be the union of balls of R™ with center at the points of 

L and radius 6. Now, if L is such that for every point p € X there exists an /j G L 


such that Up ~ 


< —, then, for every S < 



X is a deformation retract of U. 


So they have the same homology. 


Remark The proof of Proposition 12.21 gives us a way to construct a retraction 
TT : 17 —>■ X and a homotopy F : Uxl ^ U such that F{q, 0) = q and F{q, 1) = 7r(g). 
Let ttq : Tubr —>■ X be the canonical projection from the tubular neighborhood of 
radius r of X onto X. Then tt is the restriction of ttq to U for which it holds: 


7r(g) 


argmin||( 7 -p||. 

pGX 
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Then the homotopy is given by 

It is also important to observe that the retraction tt moves the points of U less 
than S; this is because the trajectory of TT{q) always remains inside a ball of U that 
contains q {q can be contained in the intersection of different balls), for every q £ U. 
In fact Tr~^{q) = U fl Tan^ fl Br{q) (for a complete argument we refer to Section 4 
of [H]). 


3. Retracts 

Aim of this Section is to yield two rather general results, which will be special¬ 
ized to Theorem 14.11 and Lemma I5 t 1 Throughout this Section, Y will be a compact 
Riemannian (hence triangulable) submanifold of R™ and V will be a compact, tri- 
angulable subspace of R™ such that T is a deformation retract of V , with retraction 
r and homotopy G : V x I ^ V from the identity of V, ly, to r. Moreover Vi/ G Y, 
Vu G Vt G / we assume that (r o G){v,t) = y. 

Let also / : R™ —>■ R" be a continuous function, and fy and fv be the restrictions 
of / to F and V respectively. 

Lemma Y {fy ^ x) is a deformation retract of V {fy o r < x). 

Proof. Let : V{fyor ^ x) —>■ Y{fy ^ x) be the restriction of r to V{fyor P x). 
It is well-defined since, by the definition of the two sets, rx{V{fy o r P x)) C 

Y {fv ^ x). We now set Gx ■ V {fy or<x)xI^V {fy o r ^ x) as the restriction 
of G to R {fy or P x) X I. This restriction is well-defined, because the path from 

V to rx{v) is all contained in V{fy o r P x), thanks to the assumptions on G and 

r. Moreover, it is continuous and for every v G V{fy or ^ x), Gx{v,0) = v and 
Gx{v, 1) = rx{v). So it is the searched for deformation retraction. □ 

Remark Since the homotopy G is relative to Y (i.e. keeps the points of Y fixed 
throughout), this is what is called a strong deformation retract in [15) . 

Now let e = max„gy ||r(u) —u|| and u}{e) = (11(e),..., f^(e)) G R", where 11 is the 
modulus of continuity of f (Def. 12 . 21 ) . For sake of simplicity, set now By = Bty h, 

Lemma If (u, v) is a point of A+ and if u -I- w(e) -< x — aj(e), then 

j3y{u — w(e), u -I- w(e)) < /3y (u, v) < I3y{u + uj(e),v — a;(e)) 

Proof. For uj = u;{e) such that u + uj -< v — co we can consider the following diagram 



where the maps a, 7 , 6, rj, 9, l are inclusions, f is the composition of r„, where 
Xu is as in the proof of Lemma |3] with x = u, with the inclusion of Vu-uj into 
Y {fy or P u), and f is similarly defined as the composition of r„ with the inclusion 
iy of Vy-ui into V{fy or Pv). Let us observe that, passing to homology and using 
the symbol * to denote the induced homomorphisms, the dimension of the image 
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of ( 5 * o 7 * o a* is precisely fiv{u — ui,v + to), that of ry* is /3y(m, v), and that of 7 » is 
Pv{u + uj,v — ui). 

The first claimed inequality will follow from the commutativity of the large 
trapezoid in diagram (HD up to homotopy. Indeed, if 5 o 7 o a is homotopic to 
L o rj o f, then, passing to homology, the map 5 * o 7 , o a* is equal to the map 

o ry* o f*. As a consequence the dimension of the image of d* o 7 , o a, is not 
grater than that of ry*. Moreover, the second claimed inequality will follow from 
the commutativity of the small trapezoid because, in this case, passing to homology 
we have ly, = f* o 7 , o 0*. Therefore, the dimension of the image of ly* is not greater 
than that of 7 *. 

We begin proving that the small trapezoid commutes exactly. We observe that 
f is the identity map on the points of Y. Since Yu CY, we have that f o 7 o 0 is 
the canonical inclusion of Yu in W, • 

To prove the commutativity of the large trapezoid up to homotopy, using the 
exact commutativity of the small trapezoid, it is sufficient to prove that the two 
triangles in diagram CD commute up to homotopy. As for the left triangle, consider 
G : Vu-ui X 7 —>• Vu+ui be the composition G = G„ o x 1/), where Gs is as in the 
proof of LemmaO Now, for every v G Vu-ui, we have G{v, 0) = G(u, 0) = u = a{v) 
and G{v, 1 ) = G(u, 1) = r(v) = f{v) = 9o f[v). Hence a is homotopic iodof. The 
proof that <5 is homotopic to f o t is analogous. 

□ 


Remark The diagram of the previous proof, setting v = u + lo, shows that the 
persistence modules of Y and V are w-interleaved [21] (strongly w-interleaved, in 
the terminology of [ 6 ]), so that their interleaving distance is > w. 

Lemma If (u, v) is a point of A+ such that u + uj{e) -< v — uj{£) and 
j3v{u - ui{e), V + a;(e)) = f3viu + a;(e), v - a;(e)) 


then 

I3y{u,v) = Pv{u,v) 


Proof. Straightforward from Lemma [3] and from the fact that fiy is non-decreasing 
in the first variable and non-increasing in the second one. □ 


The next Lemma shows that there may be whole regions (hyperparallelepipeds) 
where the PBN’s of Y and V are known to agree. 

Lemma Let u,u, u', v,v, v' G K” be such that u<u — w(e), u Y w(e) Y u' Y v Y 
v — uj{e), v + uj{e)Yv'. If 

Pviu.v') = Pviu'jv) 

then 

PYiu,v) = Pv{u,v) 


Proof. Since /3v is non-decreasing in the first variable and non-increasing in the 
second one, we have, with oj = uj{e), 

Pviu, v') < I3v{u,v + uj) < /3y(u — uj,v + uj) < 

< I3v{u + uj,v — uj) < + uj,v) < I3v{u, v) 

whence, by the hypothesis j3v{u,v') = /3u(u',u), we get /3u(u —w, iJ-l-a;) = fiv{u + 
uj,v — uj). Lemma [3] then yields the thesis. □ 
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As a consequence of the preceding lemmas, the regions of A+ where /3y and 
Py may possibly disagree can be precisely localized in a neighborhood of the dis¬ 
continuity set of Pv (viewed as an integer function). In the case n = 1 where 
the discontinuity sets are proved to be (possibly infinite) line segments [17], these 
regions are called blind strips. We stress that the position of the blind stips is 
well-known, since it is determined by the position of the discontinuity lines of the 
PBNs of V. 

Lemma If u, U £ K” are such that 

Py{u,v) ^ Pv{u,v) 

then there is at least a point {u,v) with max norm ||(u,7;) — (u,i;)|| < fl(e), which 
is either an element of the boundary of A'*' or a discontinuity point of Pv- 

Proof. We prove the contrapositive. 

Let (u, v) £ M" be a point of A+. Let D be the closed ball of radius 11 centered 
at {u,v). Set oj = oj(e), u = u — uj, u' = u + w, v = v — ui, v' = v + co; D is the 
closed hypercube {(w, u) £ R" | (u,v) ^ (u,v) A its boundary contains 

(u, v') and (u', v). 

Now, assume that (u,v) is at distance greater than 17 from the boundary of A'^ 
(with the max norm distance). Then the point (u', v), belonging to Z7, is contained 
in A'^. This means that u' ^ v. 

Moreover, assume that {u,v) is at distance greater than 17 from any discontinuity 
point of Pv- Then there is an open subset of A+, containing D, on whose points 
Pv is continuous. Since the range of Pv has the discrete topology, this implies that 
Pv is constant on this whole open set, hence also on D. Then Pv{u, v') = Pv{u', v); 
therefore Py{u,v) = Pv{u,v) by Lemma [3| □ 

4. Ball coverings 

Throughout this Section, X will be a compact Riemannian (triangulable) sub¬ 
manifold of R™. As hinted in Section [TJ we want to get information on X out of 
a finite set of points. First, the points will be sampled on X itself, then even in a 
(narrow) neighborhood. In both cases, the idea is to consider a covering of X made 
of balls centered on the sampling points. 

What we get, is a double inequality which yields an estimate of the PBNs of X 
within a fixed distance from the discontinuity sets of the PBNs (meant as integer 
functions on A+) of the union U of the balls of the covering, but Lemma [3] even 
offers the exact value of it at points sufficiently far from the discontinuity sets. 

4.1. Points on X. Let <5 < and let L = {/i,... ,/fe} be a set of points of 

X such that for every p G X there exists an Ij £ L for which ||_p — ^j|| < -. Let 

U be the union of the balls B{lj,S) of radius 6 centered at /j, j = 1, ..., fc. So 
all conditions of Proposition 12.21 are satisfied. As before, set Pu = P{u,fu,i) 

Px = P(XJx,i)- 


Theorem If (m,u) is a point of A+ and if m -I- a;(5) -< v — uj(S), where lj(S) = 
(f7((5),..., £ R", then 

Pu(u - cj(S),v + uj(S)) < Px(u, v) < Pu{u + uj{S),v - uj{S)) 


ESTIMATING MULTIDIMENSIONAL PERSISTENT HOMOLOGY 


7 


If M, t; £ R" are such that 

f3x{u,v) ^ Pu{%v) 

then there is at least a point (u,v) with max norm ||(u, n) — (m,'C|| < which 

is either an element of the boundary of A+ or a discontinuity point of f3u- 

Proof. By Lemmas [3] and [3 with Y = X, V = U. □ 


4.2. Points near X. So far we have approximated X by points picked up on 
X itself, but it is also possible to choose the points near X, by respecting some 
constraints. Once more, this is possible thanks to a result of [22] - 
Proposition (Prop. 7.1 of [22]) Let L = {li,... ,1^} be a set of points in the 
tubular neighborhood of radius s around X and U = Uj=i k '^) tie the union 
of the balls of R"* centered at the points of L and with radius S. If for every point p £ 
X, there exist a point Ij £ L such that ||p —< s, then C/ is a deformation retract 
of A, for all S < (^9 - VS)t and <5 £ (^ i^+r)-Vs-+r-~ ^^ (^s+r) + Vs-+r-^6 ^^ 


Then, as with Theorem 14.11 we have, with an analogous proof: 

Theorem Under the hypotheses of Proposition 14.21 if {u,v) is a point of A+ and 
if u + uj{6 + s) -< V — oj{d + s), where oj{6 + s) = (U(i5 + s),..., + s)) £ R", then 


Pu{u — uj{d + s), + uj{6 + s)) < Px{u, v) < (3u{u + uj{6 + s),v — uj{ 6 + s)) 


If M, u £ R" are such that 

I3x(u,v) ^ Pu{u,v) 

then there is at least a point (u, v) with max norm || (u, v) — (ft, u|| < 0((5 + s), which 
is either an element of the boundary of A'*' or a discontinuity point of f3u- 


4.3. The ID case. We now show how Theorem l4.11 can be used for applications. In 
the case of filtering functions with one-dimensional range, and in the ID reduction 
of Section 2.1 of [2] and Section 4 of |5], the subsets of R^ where the PBN functions 
are not continuous form (possibly infinite) line segments: |17j . We recall that by 
Lemma 13 the blind strips, i.e. the regions where the equality = j3v(u,v) 

is not granted, are 2w(e) wide strips around such segments. 

In the case of ball coverings as in Section 14.11 the width of the blind strips is 
a representation of the approximation error, in that it is directly related to D(5), 
where 1 /S represents the density of the sampling. 

Let A be a circle of radius 4 in R^ (Figure |T|); we observe that r is exactly the 
radius of X, so t = 4. In order to create a well defined approximation we need that 

In the first example we have taken S = 0.5. Now, to satisfy the hypothesis of 
Theorem 14.11 (that for every p G X there exists an /j £ L such that \\p — lj\\ < |), 
we have chosen 64 points Ij on X. Moreover we have sampled X uniformly, so that 
there is a point every radians (Figure |2|). We stick to the monodimensional case, 
choosing / : R^ —>■ R, with f{x,y) = abs(y). U is the resulting ball union. 

Figures 13 and 13 represent the PBN functions at degree zero of X and U respec¬ 
tively. A"*" is the half-plane above the diagonal line, and the numbers are the values 
of the PBNs in the triangular regions they are written in. In Figure |3 there is only 
one big triangle where the value 2 signals the two different connected components 
generated by fx- The two connected components collapse to one at value 4. In 
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Figure 1. The 
circle of radius 
4, X. 



Figure 3. The repre¬ 
sentation of /3(x,/x,o)i 
the 0-PBNs of X.’ 



Figure 2. The 
ball union U. 



Figure 4. The represen¬ 
tation of /3((7,/t/,o)i 0" 

PBNs of the ball union U. 



Figure 5. The blind 
strips of l3(uju,o)- 


Figure |4] there is also a big triangle representing the two connected components, but 
they collapse at value 3.53106. Moreover there are 4 other very small triangles near 
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the diagonal, representing more connected components generated by the approxi¬ 
mation error. In the last figure (Figure [Sj the blind strips around the discontinuity 
lines of Ptjjjufi) shown. The width of these strips, since fl(<5) = 0.5, is equal to 
2Q,{5) = 1. This figure illustrates the idea underlying Theorem 14.II Taken a point 
{u,v) outside the strips, the values of the PBNs of U at (rt — n{6),v -I- ^{S)) and 
(m -I- ft{6),v — fl((5)) are the same. So also the value of the PBNs of X at (u, v) is 
determined. Figures El [71 [8] depict, in analogous way, the (obviously much simpler) 
PBNs of degree 1. 



Figure 6 . The repre¬ 
sentation of /3 (x,/x,i)5 
the 1-PBNs oix'. 



Figure 7. The represen¬ 
tation of the 1- 

PBNs of the ball union U. 



Figure 8. The blind 
strips of /3((7,/t/,i)- 

For a second example we have chosen the points Ij not necessarily on X. We have 
satisfied the hypothesis of Proposition 14.21 choosing s = 0.25 and S = 0.55. Then, 
in order to cover X well, we have chosen a point every ^ radians, for a total of 96 
points. But this time the points are either 0 or 0.1 or 0.2 away from X. Figure El 
shows the resulting ball union U'. As in the previous case, in the representation of 
iFigure lTOl) there is one big triangle showing two connected components 
and this time they collapse at value 3.40955. Compared to FigurejH there are many 
more small triangles generated by the asymmetry of the sampling. The width of 
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the blind strips in Figure [TT] is 2il{d + s) = 1.6, so there is still the central triangle. 
This means that, although the error in the approximation is much bigger, the blind 
strips do not cover the entire figure, leaving the topological information intact at 
least in some small areas of A+. 



Figure 9. The ball 
union U'. 



Figure 10. The repre¬ 
sentation of 
the 0-PBNs of the ball 
union U'. 



Figure 11. The blind 
strips of 


5. Shape comparison 

So, outside the blind strips, we can get the values of PBNs of the sampled 
object X out of its ball covering U (as always, given a filtering function / defined 
on the ambient space MX). This fact gives us the possibility of assessing shape 
dissimilarity of two pairs {X,f), {Y,g) although we only know ball coverings U, 
W of them, respectively. For this we need the notion of natural pseudodistance, 
introduced in m and further studied in [mill], and its relationship with PBNs. 
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We recall that, for any two topological spaces X, Y endowed with two continuous 
functions f : X ^ MY, g :Y ^ R", we have the following definition. 

Definition The natural pseudodistance between the pairs {X,f) and {Y,g), de¬ 
noted by 6 {{X, /), (r, g)), is 

(i) the number inf/j maxa;gx Wfi^) —g{h{^))\\ where h varies in the set H{X, Y) 
of all the homeomorphisms between X and Y, if X and Y are homeomor- 
phic; 

(ii) -|-oo, if X and Y are not homeomorphic. 

5.1. Lo-wer bounds for the natural pseudodistance. The following theorem, 
which extends Theorem 1 of [10) . provides us with a lower bound for the natural 
pseudodistance. 

Theorem Let {X,f) and {Y,g) be two pairs. If, for some degree i, fd(xj,i){u,v) > 
P{Y,g,i)iu',v') then 

S {{X, /), (F, g)) > min min{M^ - Ur}, minjur - w'} \ 

I r r J 

Proof. Set ^ = min{miiv{M^ — Ur}, minr{ur — v'r}}- If ^ < 0 then the thesis triv¬ 
ially holds; so assume ^ > 0. Since > 0 we have X{f < u) ^ ih- 

Assume that the thesis does not hold, so 5 {{X, /), {Y,g)) < Under this assump¬ 
tion there exists a homeomorphism h : X ^ Y such that 

max||/(x)-g(h(x))|| < 

xGX 

Set 0(/i) = maxa;gx \\f(.x) — g(/i(x))|l. So 0(/i) < and, setting (f = {f, ■ ■ ■ ,f) we 
have that h maps the lower level subset X{f P u) into Y{g P u + C) C Y{g Y u'). 
Analogously, h~^ maps Y{g Y v') into X{f Y v' + f) C X{f P v). Set A„ = 
X{f ^ u), Xu’ = h-^{Y{g P u')), Xu' = h-^{Y{g < v')), X^ = X{f ^ v). Then 
there are inclusion maps 

^u,u' ■ Xu ^ Xu', i'u’,v' ■ Xu' t Xu' , I'v',V ■ Xyf " ^ Xy 

and iu.v • Xy ^ y Xy, with iu^v — f'v'.v ® ^u',v' ® ^u^u'• 

The pairs (Y{g P v'),Y{g P u')) and (A„<, A^/) are homeomorphic, so they can 
be interchanged in what follows. Our claim is that for each degree i the dimension 
of the image of the homomorphism induced in homology by Lu,v is less than or 
equal to the one of the image of the homomorphism induced by iu'y- First, note 
that all dimensions considered here are finite (Theorem 2.3 of [3). Then, each 
inclusion map l induces a homology homomorphism L^,, and (lu.v)* = (i-v'.v)* o 
{lu',v')* o ii-u,u')*- But then dimlm((,„^„)* < dimIm(6„/,„/)*, so (m, u) < 

dimlm(6„'^„')* = ld(Y,g,i){u',v') against the hypothesis. □ 

5.2. An example of comparison of sampled shapes. When only finite, dense 
enough samples - or, equivalently, ball coverings - of two objects are available, if 
there is a nonempty intersection of the complements of the blind strips, we can still 
assess the natural pseudodistance between them. 

This is the case of the following example: We obtain a lower bound for 6{X, Y), 
where X is the circle and Y is the bean-shaped curve of Figure [12] and Figure (TS) 
For both spaces the filtering functions / and g are the restrictions of the absolute 
value of ordinate. The 0-PBNs for (A, /) and (Y, g) are depicted in Figures [T4l and 
[T5] respectively. 
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Figure 12. A circle X 
covered with balls of ra¬ 
dius 0.4. 



Figure 14. The 0- 
PBNs of the ball 
covering of X, with blind 
strips. 



Figure 13. A different 
curve Y covered with 
balls of radius 0.4. 



Figure 15. The 0- 
PBNs of the ball 
covering of F, with blind 
strips. 


There is a triangle, not covered by blind strips in both diagrams, where the PBN 
for A is 2 and for F is 3 by Theorem 14.II So we have 

/^(Y,g,o)(0.4, 2.2) = 3 > 2 = /3(;f./.0)(l.l, 1.5) 

and, by Theorem 15.11 S {{X, f), (Y, g)) > 0.7. 

If we made use of smaller - but denser - balls of radius 0.2, then we would have 

/3(y,g,o)(0.2,2.6) = 3 > 2 = /3(;f j,o)(1.3,1.5) 

and S ((A, /), (F, g)) > 1.1. (The true natural pseudodistance of the two pairs can 
be computed directly from the definition, and equals 1.5.) 

6. A COMBINATORIAL REPRESENTATION 

The ball unions of Section^ although generated by finite sets, are still continuous 
objects. It is desirable that the topological information on A, up to a certain 
approximation, be condensed in a combinatorial object. For size functions (i.e. 
for PBNs of degree 0) it was a graph; here, it has to be a simplicial complex. We 
shall build such a complex, by following m. to which we refer for all definitions not 
reported here. Please note that [13] uses weighted Voronoi cells and diagrams, while 
we do not need to worry about that, since all of our balls have the same radius; 





























ESTIMATING MULTIDIMENSIONAL PERSISTENT HOMOLOGY 


13 


SO the customary Euclidean distance can be used instead of the power distance 
employed in that paper. 

Let X, L = {li,... Jk} and (5 be as in Section 14.11 (the case of Section 14.21 is 
an immediate extension). Moreover, let the points of L be in general position. 
For each Ij G L, let Bj = B{lj,S) be the ball of radius S, centered at Ij. The 
set B = {Bi,..., Bk} is a ball covering of X; denote by U the corresponding ball 
union. Let now Vj be the Voronoi cell of Bj^ i.e. the set of points of R™ whose 
distance from Ij is not greater than the distance from any other Iji. 

The set V = {Vi,...,Vfc} is the Voronoi diagram of B. From V we get the 
collection of cells Q = {Vj = Vj n Bj \j = 1,... ,k}, a. decomposition of U. 

The nerve N(Q) of Q is the abstract simplicial complex where vertices are the 
elements of Q and, for a subset T of{l,...,fe}, the set of vertices {Vj \ j G T} is a 
simplex if and only if Hjer v;^%. 

For any T C {1,..., A:}, T 0 we denote by ut the convex hull of {Ij \jGT{ 

The dual complex of Q is /C = {ar \ {Vj \jGT}G iV(Q)} and S = |/C|, union of 
the simplices of /C, is the dual shape of U. 

For a better understanding of the previous part we produce a toy example. Let 
X be a quarter of circle of radius 4 and U be the union of nine balls of radius 1, 
with centers near X fFigure fTBl) . The Voronoi Diagram V associated to this ball 
covering B is depicted in Figure [T71 



Figure 16. A quarter 
of circle of radius 4 cov¬ 
ered by nine balls of ra¬ 
dius 1. 



Figure 17. The Voronoi 
Diagram V of i?. 


Now the main idea is that we can associate the dual complex K, with the sub¬ 
manifold X. In fact, by Theorem 3.2 of [13], its space S is homotopically equivalent 
to U and, by transitivity, to X. Moreover, Section 3 of m explicitly builds a re¬ 
traction r from U to S and a homotopy H from the identity of U, to p, such that 
Vy G S,\/v G p~^{y), Vt € / we have {p o H){v, t) = y. For a complete description 
of the homotopy H and the retraction p we refer to the original article. 

K. and S are shown in Figures [T51 fTOl respectively. 

6.1. Ball union and dual shape. Let / : R™ — >■ R” be a continuous function 
and let fs and fu be the restrictions of / to 5 and U respectively. The notation 
for the pen’s /3 is simplified as before. 
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Figure 18. The dual 
complex K. 



Figure 19. The dual 
shape S. 


Lemma If {u^v) is a point of A+ and if u + uj{S) -< v — uj(S), where uj(S) = 
, n(5)) € R” , then 

/3u(u - uj(S),v + uj(S)) < /3s(u,v) < /3u(u + uj(S), v - uj(S)). 

Ifu,v€ R" are such that 

I3u{u,v) ^ I3s{u,v) 

then there is at least a point {u,v) with max norm ||(m,U) — ({!,{;)|| < O(^), which 
is either an element of the boundary of A’*' or a discontinuity point of f3ir. 

Proof. By Lemma [3l with Y = S, V = U. □ 


Now we can get an estimate of the PBNs of X from the ones of S. The blind 
strips of the ID reduction will be doubly wide, with respect to the ones previously 
considered. Still, this can leave some regions of A’*' where the computation is exact. 
Also here the position of the blind strips is well determined by the position of the 
discontinuity lines of the PBNs of S. 

Theorem If {u,v) is a point of A+ and if m + 2uj(6) -< v — 2w(i5), where uj{S) = 
(n(i5),..., fl{S)) € R” , then 

/3s(m — 2uj{S), V + 2uj{S)) < /3x{u, v) < Ps{u + 2a;((5), v — 2oj{5)). 

If M, U £ R" are such that 

I3x{u,v) ^ I3s{u,v) 

then there is at least a point {u,v) with max norm ||(?I, w) — ('U,'D)|| < 2D(5), which 
is either an element of the boundary of A’*' or a discontinuity point of f3s. 


Proof. By Theorem 14.11 and with uj = uj(S), 

/3u(u — uj,v + uj) < /3x(u, v) < (3u{u + uj,v — uj) 

Then we have 

f^u(,u + UJ, V — uj) < /3s{u + 2uj, V — 2uj) 
by Lemma lOI bv substituting (m, v) with (u + 2uj, v — 2uj), and 
Psiu — 2w ,V + 2uj) < Puiu — uj,v + uj) 

by Lemma lOI bv substituting {u,v) with {u — 2 uj,v + 2uj). 

The second part can be proved in an analogous way from the second part of 
Lemma 16.11 □ 
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6.2. An example in 2D persistence. As a simple example, we now apply Theo- 
rems l5.l| and l6.1l to two pairs (A, /) and {Y, g) with 2-dimensional filtering functions. 
Let X and Y be two circles of radius r embedded in and /, g be (unknown) con¬ 
tinuous functions from to R^, whose restrictions to suitable neighbourhoods of 
X,Y have modulus of continuity n{S) = Here the range R^ parametrizes the 
plane H of equation R + G + B = (chosen as one which contains a fairly 

large square of points with nonnegative coordinates) of the color space RGB. A 
Cartesian reference frame x, y, z has been fixed in the RGB space, so that H is the 
z = 0 plane. The change of reference is: 



Of the functions on two circles we know a sampling given by 80 regularly spaced 
points, represented in Figure EUl 








••••• 






• •••• 


••••• 






Figure 20. The color samplings of (A,/) (left) and {Y,g) (right). 


We can think of the 80 points on each circle as centers of disks of radius 5 = 0.08r. 
The distance between two consecutive points is 2r7r sin(^) s]0,078r, 0.079r[, so the 
balls form a covering which respects the hypotheses of Proposition 12.21 since r = r 

and 0.08 < The corresponding dual complex is a ID cycle and the dual shape 
is a closed polygonal. 

The point images in the color plane H are along the edges of the square of 
vertices (—0.4, 0), (—0.4, 0.8), (0.4, 0.8), (0.4, 0) for (A,/) and along the polygonal 
of vertices (-0.4, 0.04), (0.36, 0.8), (0.4, 0.8), (0.4, 0.76), (-0.36, 0) (twice) for {Y,g), 
two consecutive ones at distance 0.04 = n{S) in the max norm. 

A ID reduction of the PBNs is possible through a foliation of the domain R^ x R^. 
In the terminology of mm, a suitable admissible pair l,b is given by ^ = (-^, ■^) 

and b = (—0.2,0.2); in the corresponding leaf of the foliation, the PBN’s of the dual 
shapes and the blind strips according to Theorem 16.II are as shown in Figure!^ 

In correspondence of points P,Q in the leaf, we get {u,v),{u',v') G R^ x R^ 
respectively, with u = (—0.28,0.12), v = (0.32,0.72), u' = (—0.06,0.34), v' = 
(0.1, 0.5). Then we have 

/3(y.s.o)(m,u) = 2 > 1 = l3(^xj,o){u,v') 
and, by Theorem 15.11 S ((A,/), {Y,g)) > 0.22. 
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Figure 21. PBNs of the two colored circles 
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